NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards Scientific Discovery with Generative AI: Progress, Opportunities, and Challenges

https://doi.org/10.1609/aaai.v39i27.35084

Reddy, Chandan K; Shojaee, Parshin (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Scientific discovery is a complex cognitive process that has driven human knowledge and technological progress for centuries. While artificial intelligence (AI) has made significant advances in automating aspects of scientific reasoning, simulation, and experimentation, we still lack integrated AI systems capable of performing autonomous long-term scientific research and discovery. This paper examines the current state of AI for scientific discovery, highlighting recent progress in large language models and other AI techniques applied to scientific tasks. We then outline key challenges and promising research directions toward developing more comprehensive AI systems for scientific discovery, including the need for science-focused AI agents, improved benchmarks and evaluation metrics, multimodal scientific representations, and unified frameworks combining reasoning, theorem proving, and data-driven modeling. Addressing these challenges could lead to transformative AI tools to accelerate progress across disciplines towards scientific discovery.
more » « less
Full Text Available
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

Shojaee, Parshin; Meidani, Kazem; Gupta, Shashank; Farimani, Amir; Reddy, Chandan K (April 2025, ICLR)

Full Text Available
Evolutionary Large Language Model for Automated Feature Transformation

https://doi.org/10.1609/aaai.v39i16.33851

Gong, Nanxu; Reddy, Chandan K; Ying, Wangyang; Chen, Haifeng; Fu, Yanjie (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Feature transformation aims to reconstruct the feature space of raw features to enhance the performance of downstream models. However, the exponential growth in the combinations of features and operations poses a challenge, making it difficult for existing methods to efficiently explore a wide space. Additionally, their optimization is solely driven by the accuracy of downstream models in specific domains, neglecting the acquisition of general feature knowledge. To fill this research gap, we propose an evolutionary LLM framework for automated feature transformation. This framework consists of two parts: 1) constructing a multi-population database through an RL data collector while utilizing evolutionary algorithm strategies for database maintenance, and 2) utilizing the ability of Large Language Model (LLM) in sequence understanding, we employ few-shot prompts to guide LLM in generating superior samples based on feature transformation sequence distinction. Leveraging the multi-population database initially provides a wide search scope to discover excellent populations. Through culling and evolution, high-quality populations are given greater opportunities, thereby furthering the pursuit of optimal individuals. By integrating LLMs with evolutionary algorithms, we achieve efficient exploration within a vast space, while harnessing feature knowledge to propel optimization, thus realizing a more adaptable search paradigm. Finally, we empirically demonstrate the effectiveness and generality of our proposed method.
more » « less
Full Text Available
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables

https://doi.org/10.18653/v1/2025.naacl-long.445

Abhyankar, Nikhil; Gupta, Vivek; Roth, Dan; Reddy, Chandan K (January 2025, Association for Computational Linguistics)

Full Text Available
Sycophancy Mitigation Through Reinforcement Learning with Uncertainty-Aware Adaptive Reasoning Trajectories

https://doi.org/10.18653/v1/2025.emnlp-main.661

Beigi, Mohammad; Shen, Ying; Shojaee, Parshin; Wang, Qifan; Wang, Zichao; Reddy, Chandan K; Jin, Ming; Huang, Lifu (January 2025, Association for Computational Linguistics)

Full Text Available
Discovery of generalizable TBI phenotypes using multivariate time-series clustering

https://doi.org/10.1016/j.compbiomed.2024.108997

Ghaderi, Hamid; Foreman, Brandon; Reddy, Chandan K; Subbian, Vignesh (September 2024, Computers in Biology and Medicine)

Full Text Available
Identifying TBI Physiological States by Clustering Multivariate Clinical Time-Series Data

Ghaderi, Hamid; Foreman, Brandon; Nayebi, Amin; Reddy, Chandan K; Subbian, Vignesh (January 2024, AMIA Annual Symposium Proceedings)

Full Text Available
Multi-Label Clinical Time-Series Generation via Conditional GAN

https://doi.org/10.1109/TKDE.2023.3310909

Lu, Chang; Reddy, Chandan K.; Wang, Ping; Nie, Dong; Ning, Yue (August 2023, IEEE Transactions on Knowledge and Data Engineering)

Full Text Available
WindowSHAP: An efficient framework for explaining time-series classifiers based on Shapley values

https://doi.org/10.1016/j.jbi.2023.104438

Nayebi, Amin; Tipirneni, Sindhu; Reddy, Chandan K.; Foreman, Brandon; Subbian, Vignesh (August 2023, Journal of Biomedical Informatics)

Full Text Available
Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series

https://doi.org/10.1145/3516367

Tipirneni, Sindhu; Reddy, Chandan K. (December 2022, ACM Transactions on Knowledge Discovery from Data)

Multivariate time-series data are frequently observed in critical care settings and are typically characterized by sparsity (missing information) and irregular time intervals. Existing approaches for learning representations in this domain handle these challenges by either aggregation or imputation of values, which in-turn suppresses the fine-grained information and adds undesirable noise/overhead into the machine learning model. To tackle this problem, we propose a S elf-supervised Tra nsformer for T ime- S eries (STraTS) model, which overcomes these pitfalls by treating time-series as a set of observation triplets instead of using the standard dense matrix representation. It employs a novel Continuous Value Embedding technique to encode continuous time and variable values without the need for discretization. It is composed of a Transformer component with multi-head attention layers, which enable it to learn contextual triplet embeddings while avoiding the problems of recurrence and vanishing gradients that occur in recurrent architectures. In addition, to tackle the problem of limited availability of labeled data (which is typically observed in many healthcare applications), STraTS utilizes self-supervision by leveraging unlabeled data to learn better representations by using time-series forecasting as an auxiliary proxy task. Experiments on real-world multivariate clinical time-series benchmark datasets demonstrate that STraTS has better prediction performance than state-of-the-art methods for mortality prediction, especially when labeled data is limited. Finally, we also present an interpretable version of STraTS, which can identify important measurements in the time-series data. Our data preprocessing and model implementation codes are available at https://github.com/sindhura97/STraTS .
more » « less
Full Text Available

« Prev Next »

Search for: All records